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ABSTRACT 

We use the Millennium Galaxy Catalogue (MGC) to study the effect of compact galaxies 
on the local field galaxy luminosity function. Here we observationally define as 'compact' 
galaxies that are too small to be reliably distinguished from stars using a standard star-galaxy 
separation technique. In particular, we estimate the fraction of galaxies that are misclassified 
as stars due to their compactness. 

We have spectroscopically identified all objects to B^gc = 20 mag in a 1.14 deg^ sub- 
region of the MGC, regardless of morphology. From these data we develop a model of the 
high surface brightness incompleteness and estimate that '^ 1 per cent of galaxies with Bmgc < 
20 mag are misclassified as stars, with an upper limit of 2.3 per cent at 95 per cent confidence. 
However, since the missing galaxies are preferentially sub-L* their effect on the faint end of 
the luminosity function is substantially amplified: we find that they contribute '^6 per cent 
to the total luminosity function in the range — 17 < Mb < — 14 mag, which raises the faint 
end slope a by O.OS^qq^. Their contribution to the total i?-band luminosity density is ^2 per 
cent. Roughly half of the missing galaxies have already been recovered through spectroscopy 
of morphologically stellar targets selected mainly by colour We find that the missing galaxies 
mostly consist of intrinsically small, blue, sta r-forming, su h-L* objects. 

In combination with the recent results of lDriver et alJ C200 5) we have now demonstrated 
that the MGC is free from both high and low surface brightness selection bias for giant galax- 
ies (Mb ^ — 17 mag). Dwarf galaxies, on the other hand, are significantly affected by these 
selection effects. To gain a complete view of the dwarf population will require both deeper 
and higher resolution surveys. 

Key words: surveys - galaxies: statistics - galaxies: fundamental parameters. 



1 INTRODUCTION 

The idea that galaxies at a given luminosity span a wide range 
in surface brightness (SB) and that selection effects might cause 
both high and low SB galaxies to be under-represented in the ob- 
served galaxy distribution goes back to at le ast Rea ves ( 1956!); 
IZwickvl J195% : IateI ITJel and lPisnevI Jl976h . Low SB galaxies 
are d ifficult to detect be cause of their low contrast against the sky 
ilmp ev & Bothug[l993) while high SB galaxies may be confused 
with stars. Previously overlooked types of galaxies have indeed 
been uncovered over tim e, such as the giant low SB galaxy Ma- 
Un 1 JBothuii et al ll987|) and several types of blue compact galax- 
ies iZwickvlll96a : see iKunth & OstlinllZOOCI for a recent review) 
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at various luminosities and redshifts, including Hll galaxies (e.g. 
Sareent c& Searle 1970; Telles et al. 1997), blue compact dwarfs 
(e.g. Thuan c& Martin 1981; Cairos et al. 2001) and luminous 
compact blue galaxies ( Phillips et al. 1997 ; Guzman et al. 2003|) 
which in turn include the com pact narrow emission line galaxies 
(CNELGs) of lKoo et alj>1994il99l ; lGuzman et aljri996i.il99^ . 
Most recently a potentially new class of faint and compact objects, 
the so-called ultra compact dw arfs (UCDs), was discovered in the 
Fornax iHilker et aL. 1999.; ■Drinkwateretal^.200Q. .2003.; see also 
iMieske. Hilker & Infantell2004. Abell 1689 JMieske et alj|2004h 
and Virgo clusters JHasegan et all2005l) . 

The extent to which a given survey misses these sorts of 
galaxies depends on the survey's detailed selection limits in the 
luminosity-SB plane (LSP). These in turn are determined by the 
survey's apparent magnitude, SB and size limits, i.e. by its depth 
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and resolution. Recently, IPriver et alj J2003) presented the bivari- 
ate brig htness distributio n of the Millennium Galaxy Catalogue 
("MGCi lLiske et alJl2003h . i.e. the space density of galaxies in the 
LSP. They used simulations to determine the MGC's selection lim- 
its and found that the observed galaxy distribution was well sepa- 
rated from both the low and high SB selection limits for luminosi- 
ties brighter than at least Mb ~ — 18 mag (see their fig. 12). Hence 
SB selection effects are not a significant issue in the MGC at these 
luminosities (unless the SB distribution of galaxies is bimodal) and 
in that sense the bright end of MGC's luminosity function (LF) is 
truly 'global'. The goal is now to obtain a similarly complete pic- 
ture at lower luminosities. 

To probe the galaxies beyond the MGC's low SB limit requires 
deeper imaging and spectroscopy on 8-m class telescopes. 

In this paper we will instead address the question of what lies 
beyond the MGC's high SB selection limit. In particular, what is the 
effect of the compact galaxies beyond this limit on the local field 
galaxy LF and what is their contribution to the luminosity density? 
To answer these questions we must first understand and quantify 
the MGC's incompleteness due to some compact galaxies having 
been misclassified as stars from the imaging data. To this end we 
have assembled two datasets: 

First, we conducted an all-object spectroscopic survey in the 
range 16 ^ Bmgc < 20 mag over a 1.14 deg^ sub-region of the 
MGC, targeting all detected objects in this magnitude range, re- 
gardless of classification, morphology, colour or any other proper- 
ties. This complete sample will be used to determine the fraction of 
galaxies that have been misclassified due to their compact nature. 

A few such all-object su r veys have been performed previ- 
ously: iMorton. Krug & TrittonI il985l) observ ed all 606 objects to 
B < 20 mag in a 0.31 deg^ region while 'CoUess et al.' f 1990", 



2 DATA 

2.1 The MGC 



119911) investigated a random sample of 266 objects in the range 
21 ^ b,j < 23.5 mag. Most recently, Drinkwat er et alj <2000h 
described an ambitious project to observe 14 000 objects with 
16.5 ^ 6j ^ 19.7 mag in a 12 deg'^ region centered on the Fornax 
cluster, which led to the discovery of 13 compact galaxies behind 
the Fornax cluster (mostly CNELG-like o bjects but also includinj 
four redder, weaker emission line galaxies JPrinkwater et alll99' 
as well as the UCDs in the cluster itself mentioned above. 

Unfortunately, these previous studies cannot be used to re- 
liably and quantitatively predict the number of compact galaxies 
misclassified as stars by the MGC because they either lack SB 
measurements or were performed in a special environment. On the 
other hand, many studies concerning the known compact galaxy 
populations contain very detailed SB information but they do not 
use complete samples, and hence cannot be used to make precise 
predictions either. 

The second dataset consists of all available spectroscopy of 
stellar objects in the range 16 ^ Bmgc < 20 mag in the full MGC 
survey region. We have augmented existing public data (almost ex- 
clusively of QSO candidates) with observations of additional stellar 
objects. Although these data are incomplete they provide a lower 
limit on the frequency of misclassified compact galaxies and help 
in the investigation of their properties. 

The paper is organised as follows: we describe our data in 
Section |2| study the properties of compact galaxies masquerading 
as stars in Section|3|and quantify the MGC's incompleteness due 
to misclassification in Section|4| In Section|5|we evaluate the con- 
tribution of compact galaxies to the local galaxy luminosity func- 
tion and density. Finally, we summarise our findings in Section|6| 
We use Ho = 100 /i km s"^ Mpc~\ ^m = 0.3 and Q.k = 0.7 
throughout. 



The Millennium Galaxy Catalogue (MGC; iLiske et alj|2003h is 
a deep (/.Il,„ = 26 mag arcsec^"^, Bl,m — 24 mag), wide-field 
(37.5 deg^) _B-band imaging survey conducted with the Wide Field 
Camera on the 2.5-m Isaac Newton Telescope. The survey region is 
a 72-deg long, 35-arcmin wide strip along the equator, and is fully 
contained within the survey regions of the Sloan Digital Sky Sur- 
vey (SDSS; York et al. 2000) and the 2dF Galaxy Redshift Survey 
(2dFGRS; CoUess et al-200U . 

Details of the observations, reducti on, object detection using 
SEXTRACTOR ( Bertin & Amouts U99b) and catalogue construc- 
tion are given by Liske et al. (2003). As it pertains to the present 
investigation we briefly remind the reader of our star/galaxy sepa- 
ration procedure for objects with 16 ^ Bmgc < 20 mag. 

As a starting point for classification we used the stellaricity 
parameter provided by SEXTRACTOR, which is produced for each 
object by an artificial neural network. Its input consists of nine ob- 
ject parameters (eight isophotal areas and the peak intensity) and 
the seeing. The output consists of a single number, called stellar- 
icity, which takes a value of 1 for unresolved objects, for ex- 
tended objects and intermediate values in more dubious cases. We 
began by classifying all objects with stellaricity ^ 0.98 as stel- 
lar. This is a 'natural' value to adopt because the stel laricity dis- 
tribution rises sharply from 0.97 to 0.98 (see fig. 9 of iLiske et alJ 
2003). All remaining objects were then inspected visually and clas- 
sified as stars, galaxies, asteroids (defined as apparently real objects 
without any detectable counterparts in SuperCOSMOS Sky Survey 
or SDSS images), or detections due to cosmic rays, satellite trails, 
CCD defects, etc. 

If an object was found to be incorrectly deblended or if 
the object's parameters were obviously wrong, the object was re- 
extracted by manually changing the SEXTRACTOR extraction pa- 
rameters until a satisfactory result was achieved. In addition, all 
low-quality regions in the survey (e.g. near CCD edges or defects) 
were carefully masked out, resulting in an effective survey area of 
30.88 deg^ 



2.2 Spectroscopy 

To obtain spectroscopy for the MGC (Bmgc < 20 mag) we first 
turned to publicly available data and matched the MGC to the 
SDSS Data Release 1 (SDSS -DRI; lAbazaiianet alj|2003li) . 2dF- 
GRS, 2QZ JCroom et all2004) and other smaller surveys. We then 
conducted our own redshift survey (MGCz), mainly using the Two 
Degree Field (2dF) facility on the Anglo- Australian Telescope (see 
iDriver e t al. 2005 for details). While the primary goal of this cam- 
paign was to obtain redshifts for those MGC galaxies without pub- 
lic data, we also targeted two other samples: 



2.2.1 The all-object sub-region 

The first of these additional samples consisted of all objects, irre- 
spective of morphology, image classification or colour, in a small 
sub-region of the MGC. This all-object sub-region is defined as that 
part of the MGC strip which is bounded by ll''49"52:44 < a < 
12''00™08':45 (J2000), i.e. MGC fields 56-61 (with only partial 
coverage of the end fields). The seeing in these fields ranged from 
1.05 to 1.25 arcsec with the median at 1.15 arcsec, slightly better 
than the survey median of 1.27 arcsec. The effective area of this 
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Figure 1. MGC postage stamps of the three objects in the all-object region 
without a good quality (Qz ^3) spectrum. The labels give the object ID 
(upper right), -Bmgc (upper left) and stellaricity (lower left) for each object. 
The image sizes are 33 X 33 arcsec^ . 



region, after subtraction of the exclusion regions, is 1.1371 deg 
and it contains 1552 objects (and 10 asteroids) in the range 16 ^ 
Bmgc < 20 mag. The bright limit is set by the magnitude where 
stars begin to flood and the faint limit is that of the main MGCz 
galaxy survey. 

We have at least one spectrum for each object in this sample 
and we give details of the spectroscopic identifications in TableQ 
However, for three objects the quality of the data is too low to allow 
an id entification or redshift measurement (Q^ = 2. see lPriver et alj 
l2005i) . Hence the overall spectroscopic completeness is 99.8 per 
cent. The three objects without good quality spectra are shown in 
Fig.[T] For two of these (MGC27582 and MGC95504) there is no 
possibility of misclassification: they are clearly low SB galaxies 
and we will treat them as confirmed galaxies in the rest of the paper 

In contrast, the third object, MGC25828 (J115340.51 
+001407.9), is unresolved (stellaricity = 0.99) and hence could 
be a misclassified galaxy. Since the number of such objects is small 
(cf. Table0 even a single additional case would make a significant 
difference to our incompleteness estimates. Hence we take a closer 
look at this object in the following. 

We have three independent 2dF spectra for MGC25828, one 
of which was contributed by the 2QZ (who could not identify it 
either), and these are shown in Fig.|2| Even though a high signal- 
to-noise continuum is common to all three spectra (S/N ~ 12) we 
were nevertheless unable to confidently identify this object because 
of the lack of emission and absorption features. A broad 'bump' 
near 4040 A is only evident in the bottom two spectra while the top 
spectrum shows two broad bumps at 5025 and 6185 A. If this vari- 
ation is real, i.e. due to object variability and not some instrumental 
effect, then the object must vary on the timescale of one day (time 
difference between the top two spectra), while the similarity be- 
tween the bottom two spectra (which were taken three years apart) 
must then be coincidental. 

A strong optical continuum with no or only weak broad emis- 
sion lines, and variability on the timescale of days are characteris- 
tics of blazars (e.g. Waaner & Witzel 1995). Indeed, if we interpret 
the broad bumps above as weak emission lines then they could pos- 
sibly be identified as Lya, C IV and C III] a.t z ^ 2.24 (cf. Fig.|2}- 

The possible identification of MGC25828 as a blazar is fur- 
ther supported by two facts: (i) The object has AGN-like colours 
as evidenced by its selection as a target by both the 2QZ and the 
SDSS QSO survey (who have not yet observed it), (ii) Ac cord- 
ing to the Su perCOSMOS S ky Survey JHamblv et all 1200 ih and 
USNO-B I Mo net et all2003h catalogues this object has no signifi- 
cant proper inotion. Hence it is not surprising that MGC25828 sat- 
isfies the Lon dish et al. 1 2002) selection criteria for BL Lac candi- 
dates in the 2QZ (even though it is missing from their published list 
of candidates; Londish, priv. comm.). 
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Figure 2. Three independent 2dF spectra of the possible blazar MGC25828. 
The top two spectra are from MGCz, while the bottom spectrum was ob- 
tained by the 2QZ. None of them are flux-calibrated. Each panel shows the 
object, mean sky and error spectra. The purpose of the sky spectra is to 
indicate the positions of strong sky lines and they have been scaled arbitrar- 
ily. The UT date of observation is also shown in each panel. The vertical 
dashed Unes indicate the positions of prominent broad QSO emission lines 
at z = 2.24 (top panel) and stellar absorption features at z = (bottom 
panel). 



Table 1. Spectroscopic identifications of the complete sample of objects in 
the all-object region (16 ^ -Bmgc < 20 mag). 



Description 



Number 



MCs" 


3 


Stars 


1083 


QSOs 


21 


Unidentified (blazar or WD) 


1 



Total 



1552 



"Spectroscopically identified galaxies 
which had been morphologically misclassi- 
fied as stars. 



On the other hand we do not observe any broad-band photo- 
metric variability. Comparing the object's brightness in the SDSS 
and MGC images, which were taken one year ap art, we find a dif- 
ference of only 0.07 mag (see lCross et alj|2004l for the filter con- 
version). Also, the object does not appear to be a strong radio or 
X-ray source since it is not listed in either the FIRST or ROSAT 
All-Sky Survey (RASS) catalogues. 

The lack of stellar absorption features in the spectrum of 
MGC25828 (cf. bottom panel of Fig.|2j means that this object could 
also be a DC white dwarf (e.g. Wese mael et al. 19 93). However, it 
also implies that the observed continuum is unlikely to be due to 
a stellar population and so we can discard the possibility that this 
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Table 2. Additional spectroscopic identifications of morphologically stellar 
objects outside of the all-object region (16 SC Bmgc < 20 mag). 



12 3 

u - g (mag) 

Figure 3. Colour-colour plot using SDSS-DRl PSF magnitudes showing 
our colour selection of targets for additional observations of morphologi- 
cally stellar objects outside of the all-object region. Black points and blue 
crosses show MGC stars and QSOs with 18 ^ Bmgc < 20 mag respec- 
tively. We define the band between the solid lines as the 'stellar locus' in 
this colour-colour space. Green dots show a random sample of 300 galaxies 
in the same magnitude range. Note that PSF magnitudes are physically not 
very meaningful for extended objects; we only wish to indicate the degree 
of separation between stars and galaxies in this space. Red dots show all 65 
spectroscopically identified galaxies in the MGC which had been morpho- 
logically misclassified as stars (MCs). 



object is a galaxy. We are therefore reasonably certain to have iden- 
tified all galaxies in the all-object region (within our magnitude 
limits). 



2.2.2 Additional spectroscopy of stellar objects 

Apart from the main galaxy survey and the all-object region, we 
also observed a third sample of targets when 2dF fibres were avail- 
able. The target list for this sample comprised: (i) all morphologi- 
cally stellar objects with 18 ^ -Bmgc < 20 mag that lie away from 
the stellar locus in {u — g)~(g — z) colour-colour space, where the 
'stellar locus' is defined in Fig.|3l (ii) a random subset of stellar 
objects that lie in the stellar locus (18 ^ -Bmgc < 20 mag); (iii) 
stellar objects morphologically classified as galaxies by the SDSS- 
DRl (16 ^ -Bmgc < 20 mag). Objects were only targeted if they 
had not been observed previously by other surveys. Merging the 
incomplete observations of this target list with publicly available 
data yields 3223 secure identifications of morphologically stellar 
objects outside of the all-object region, 65 per cent of which were 
supplied by MGCz. Details are given in Table|2| The relatively high 
discovery rate of misclassified galaxies compared to the previous 
Section is presumably due to the colour selection. However, this 
sample is in no way complete. Since more than one survey has sig- 
nificantly contributed to this sample and since the magnitude lim- 
its, colour selection techniques and incompleteness levels of these 
surveys are different, it is very difficult to reliably construct its se- 
lection function. For example, we expect the spectroscopic com- 
pleteness to the left of the stellar band in Fig.|3|to be very different 
from that within the band because of the MGCz colour selection. 



Description 


Number 


MCs" 

Stars 

QSOs 


62 

2625 

536 


Total 


3223 



""As in TableQ 



However, even the space to the left will not be sampled homoge- 
neously because of the colour selection of the 2QZ and SDSS QSO 
surveys. Simply ignoring these additional data does not help either 
because we specifically excluded objects already observed by these 
surveys from MGCz observations. 

In Section l43l we will ignore these complications and apply 
a rudimentary incompleteness correction to this sample based on 
the selection function of the MGCz observations. In any case, at 
the very least this sample provides an indication of the properties 
of misclassified compact galaxies as well as a lower limit on their 
incidence. 



3 PROPERTIES OF MISCLASSIFIED COMPACT 
GALAXIES 

In total we have spectroscopically identified 65 compact galaxies 
(cf. Tables Q and |2j that had been morphologically misclassified 
as stars from the imaging data (using the procedure described in 
Section im . In the following we will refer to these objects as mis- 
classified compact galaxies (MCs). We illustrate their morphologi- 
cal similarity to stars in Fig.|4] where we show a selection of their 
MGC images alongside the images of two stars. In Fig.|5|we com- 
pare their stellaricity values to the stellaricity distributions of stellar 
objects and non-misclassified galaxies. 

In this Section we will investigate the properties of these 65 
MCs in order to understand what types of galaxies are being missed 
due to misclassification. However, since these 65 MCs make up 
only about one half of the total number of MCs expected in the 
MGC (see Section|4} it is in principle possible that their properties 
are not fully representative of all MCs in general. Nevertheless, 
they provide a good indication of the types of objects that may be 
missed. 



3.1 Spectral properties 

Only five of the 65 MCs are absorption line galaxies and we show 
three example spectra in Fig.|6ja)-(c). Three of the absorption line 
MCs (M GC 1 0809, MGC 1 74 13 a nd MGC34647) appear to be E-l-A 
galaxies I Dressier & Gunn 1983) since they have strong Balmer ab- 
sorption with an H5 rest equivalent width (EW) of > 5 A superim- 
posed on an elliptical galaxy spectrum. Although MGC 10809 pos- 
sibly has a weak [O II] A3727 emission line (cf. Fig.jSJi), its EW is 
< 2.5 A and hence all three objects sa tisfy commonly used criteria 
to define E-l-A galaxies (e.g.lGoto"2005). 

Note also the unusual spectrum of MGC26040 (Fig. IS};). It has 
a very weak 4000 A break {Dm)oo = 1-28) and a blue, AGN-like 
(u — g) colour (cf. Section lT2l . It is also a radio and X-ray source. 
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Figure 4. Right: MGC postage stamps of a sample of ten misclassified compact galaxies (MCs). Left: Images of two stars for comparison. Labels and image 
sizes are as in Fig. HI 
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Stellaricity 

Figure 5. Stellaricity distributions of spectroscopically confirmed stars and 
QSOs (open red histogram), galaxies (hashed) and misclassified compact 
galaxies (MCs, solid blue). 



with detections in FIRST, NVSS and RASS. Based on these prop- 
erties _Schwope et al. (2000) and Anders on et al. 1 2003) indepen- 
dently classified MGC26040 as a probable BL Lac. This object also 
errone ously appears in the SDSS QSO catalogue or Schneider et alj 
i2003h . Erroneously because Schneider et al. required a spectrum 
to show at least one broad emission line for the object to be in- 
cluded in the catalogue, and stressed that BL Lac objects were not 
included. Indeed, the object has been excised from the mos t recent 
version of the SDSS QSO catalogue iSchneider et all2005h . 

The spectra of 60 of our 65 MCs show nebular emission lines. 
The EUV photons that ionize the emitting gas may be provided 
by either hot massive stars from recent or ongoing star forma- 
tion or by the non-thermal continuum of an active galactic nucleus 
(AGN) or both. In principle, the emission line spectrum is quite 
sensitive to the hardness of the ionizing radiation and hence emis- 
sion line ratio diagrams are frequently used to distinguish between 
star-forming or H II region-like galaxies and narrow-lined (i.e. type 
2) AGN, such as Seyfert 2 galaxies and low-ionization nuclear 
emission regions (LINERs). These line ratio diagnostics were pio- 
ne ered byiBaldwin, Phillips Sl Terlevich 1 1981 ) and further refined 
bv lVeilleux & Osterb rock i \9%%. 

To construct an [O IIl]/H/3 versus [N II]/Hq line ratio diagram 



we have measured the line fluxes of the 60 emission line MCs by 
first subtracting a locally fitted continuum and then fitting one or 
more gaussians as required. The HQf-[Nll] complex was fitted with 
at least three components where the [Nil] AA6548,6584 doublet 
was constrained to have the same width and redshift and a flux ratio 
of 1:3. For two galaxies we were able to improve the fit by adding a 
weak broad component in Ha and H/3 while five galaxies required 
a weak broad component in Ha only, indicative of Seyfert L8 and 
1.9 galaxies respectively. Hence we classify these seven objects as 
AGN and collectively refer to them as type 1.x AGN. Two example 
spectra are shown in Fig.|6{d)-(e). 

In Fig.|7|we show the resulting line ratios for 44 of the 60 
emission line MCs. The other 16 MCs are excluded from this plot 
because either their spectra do not cover Hq (11 cases) or at least 
one of the four lines could not be detected with reasonable S/N (4 
cases). Finally, one object was excluded because both the Ha and 
[O III] A5007 lines appeared to be saturated. 

The dashed line in Fig. |7| is the empirical upper enve lope 
of star-forming galaxies found by Kauff mann et alj (2003a) and 
the solid line shows the theoretical upper limit for t he location of 
pure s tarbursting galaxies in this diagram derived bv lKewlev et alJ 
teOOrt- A number of galaxies lie between these lines and it is clear 
that it is not possible to perfectly separate star-forming galaxies 
from type 2 AGN. The reason is of course that nuclear activity and 
star-formation are not mutually exclusive phenomena and an ob- 
ject's emission lines may contain contributions from both. It is well 
known that star-formation may affect the line ratios not only of the 
lower luminosity LINERs (e.g. Ho et al. 1993) but also of the more 
powerful Seyfert 2s (e.g. Cid Fernandes et al. 2001). 

Hence we follow Brinchmann e t ah t2004 ) and divide our 
MCs into three groups: we label objects below the dashed line in 
Fig.Qas star-forming galaxies (24 objects, green dots), those above 
the solid line as type 2 AGN (4 objects, red dots) and those in- 
between as composites (9 objects, orange dots). In Fig.|S(f)-(k) we 
show example type 2, composite and star-forming spectra. 

We caution the reader that the above terminology is somewhat 
misleading because it suggests that all star-formation/ AGN over- 
lap is limited to the composite objects. That is not the case: we 
plot in Fig.|7|as red triangles the narrow components of the seven 
galaxies already identified as AGN by the presence of broad com- 
ponents. One of these objects lies below the Kauffman n et al. line, 
demonstrating that even objects labell ed as 'star-form ing' above 
may in fact contain an AGN (see also ISimpsoril2005h . Similarly, 
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Figure 6. Rest-frame spectra of ten MCs representative of the various spec- 
ial classiiications. We sliow tlie object, error and mean sky spectra (wliere 
available) which have been scaled arbitrarily. Only SDSS spectra are flux 
calibrated. The vertical dashed lines mark the locations of common nebu- 
lar emission and stellar absorption features, (a) Absorption line galaxy, (b) 
Example E-^A galaxy, (c) BL Lac. (d)-(e) Example MCs requiring a broad 
emission line component at least in Ha. (f)-(k) Example MCs classified as 
type 2 AGN, composite or star-forming galaxies on the basis of Fig. 171 
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Figure 7. Emission line ratio diagnostic diagram used to classify emis- 
sion line MCs. The dashed line shows the observed upper boundary of 
star-forming galaxies i Kauffmann et al. 2003a) while the solid line gives 
the theoretical upper limit for starbursts ( Kewlev et al. 2001). Accordingly, 
green dots mark star-forming galaxies, red dots denote type 2 AGN and or- 
ange dots show composites. Red triangles show the type 1.x AGN, using 
only their narrow line components. 



a significant fraction of bona fi de Seyfert 2s are known to con tain 
a young stellar p opulation (e.g. Storc hi-Bergman n et al. 2000) and 
iBrinchmann et al. 1 2004) estimated that nearly 5 per cent of the to- 
tal star formation rate density at 2 ~ 0.1 occurs in the host galaxies 
of type 2 AGN. 

In addition, two further caveats apply to our classifications 
based on our emission line measurements: (i) Our 2dF spectra 
(from 2dFGRS and MGCz) are not flux calibrated. However, since 
the members of each line pair are quite close in wavelength we do 
not expect the flux calibration to have a large impact on the line 
ratios, (ii) We do not account for the underlying stellar Balmer ab- 
sorption. Neglecting this absorption pushes galaxies in Fig. Q to- 
ward the AGN regime. To assess the severity of t his problem we 
compa re our line ratio measurements to those of iTremonti et alJ 
y004J) who carefully modelled and subtracted the stellar contin- 
uum (including the Balmer absorption) b efore emission li ne fitting. 
Eight of our objects in Fig.Qwere also in lTremonti et alj s sample, 
but six of the 16 line ratio measurements cannot be compared to 
our s because of our use of a broad component. For the rest we find 
that lTremonti et alj s ratios are smaller than ours by less than 0.05 
dex in five cases, by 0.07 dex in one case, and by ~0.2 dex in four 
cases. From Fig. we can see that a shift of the latter magnitude 
would change the classification of three of the type 2 objects and of 
most of the composites. 

Of the 11 MCs whose spectra do not cover Ha we find that six 
show evidence of a broad component in H/3 and we add these six 
objects to the type 1.x AGN class. One further object (MGC21061) 
has an [O IIl]/H/3 ratio of 1.4, placing it firmly in the AGN regime 
of Fig. Q irrespective of its unknown [Nll]/Ha ratio. In fact this 
object shows broad Mg II emission and so we also classify it as a 
type 1.x AGN. 

This leaves a total of nine emission line objects which we are 
unable to classify because at least one pair of emission lines is un- 
available for various reasons. However, two of these objects appear 
to be almost exact superpositions of a small, faint galaxy and a 
foreground star, since they both show clear evidence of stellar ab- 
sorption lines at z = in addition to redshifted emission lines. 
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Table 3. Properties of all known 65 MCs in the MGC. 



ID 



MS 



Rt («-5)S (5-^)S logl^f] logmU" 



Class": 



Comments-' 



Absorption line objects: 



JC48184 J133616. 13+000726.3 


0.1461 


-19.04 


1.24 


1.81 


1.27 


JC10809 J104329. 26+001205.1 


0.1326 


-20.18 


1.07 


1.72 


0.95 


JC17413 J111146. 63+000017.9 


0.1104 


-19.34 


0.99 


1.66 


1.05 


JC34647 J123243. 45-000453.8 


0.1875 


-19.98 


0.75 


1.77 


0.70 


JC26040 J115404.55-001009.8 


0.2535 


-20.76 


1.05 


0.35 


1.03 


lission line objects: 












JC00704 J100140.13-000123.2 


0.3497 


-20.72 


1.93 


0.49 


0.96 


JC03855 J101326.04-000136.4 


0.2555 


-20.13 


1.84 


1.00 


1.74 


JC16803 JlllOll. 90+000352. 7 


0.2177 


-19.37 


1.63 


0.29 


1.26 


JC20403 J112525.59-000301.5 


0.3596 


-20.83 


1.87 


0.28 


0.95 


JC21061 J112753.52+000518.0 


0.4596 


-21.75 


2.96 


0.36 


1.00 


JC29999 J121040.15-000315.5 


0.3016 


-20.17 


2.63 


0.36 


1.40 


JC32112 J122102.95-000733.3 


0.3662 


-21.46 


1.55 


0.62 


0.94 


JC33386 J122804.66-000506.7 


0.3190 


-20.78 


— 


0.14 


0.83 


JC35898 J123840.24-000744.0 


0.2046 


-19.64 


1.87 


0.45 


0.34 


JC41402 J130611. 20-000932. 7 


0.3925 


-20.92 


2.40 


0.38 


1.35 


JC55396 J140434.62-000800.1 


0.3034 


-20.30 


2.55 


0.42 


0.99 


JC57877 J141207.92+000325.3 


0.2344 


-20.05 


1.39 


0.47 


0.82 


JC61874 J142631. 76-000249.8 


0.2928 


-20.19 


2.22 


0.14 


1.04 


JC67450 J144413. 26+001118.4 


0.2067 


-20.11 


2.45 


0.47 


1.12 


JC27848 J120118. 84-000654.6 


0.1818 


-19.86 


1.36 


1.03 


0.61 


JC28852 J120450. 47-001159. 5 


0.1196 


-18.46 


0.92 


0.74 


0.72 


JC32111 J122217.54-000744.1 


0.1714 


-20.26 


2.50 


1.79 


1.16 


JC63901 J143053. 30-000917.5 


0.1231 


-19.89 


1.77 


0.87 


0.64 


JC02455 J100702. 72-000937.2 


0.0666 


-17.34 


0.93 


1.07 


0.90 


JC20976 J112754. 40-001341. 7 


0.0491 


-16.28 


0.42 


1.03 


0.71 


JC21348 J113005. 55-001001. 7 


0.0684 


-17.07 


0.68 


0.93 


0.88 


JC22397 J113626. 84+000456. 7 


0.1338 


-18.66 


0.86 


0.90 


0.52 


JC27012 J115741. 37-001332. 7 


0.0851 


-17.76 


0.84 


1.16 


0.87 


JC34229 J123121. 18-000647.2 


0.0793 


-17.10 


0.73 


1.14 


0.65 


JC54685 J140108. 42-000030.4 


0.1117 


-18.13 


0.93 


1.06 


0.76 


JC56265 J140726. 91+001154.1 


0.1756 


-19.92 


1.60 


1.20 


0.74 


JC96267 J111523. 85+000622. 5 


0.0277 


-14.65 


0.46 


0.64 


0.81 


JC04637 J101532. 73-001337.6 


0.0943 


-18.10 


1.40 


1.30 


1.03 


JC12192 J104947.47+001423.1 


0.1215 


-18.56 


0.99 


1.21 


0.93 


JC14297 J105917.28+001232.8 


0.1993 


-20.24 


2.32 


1.22 


0.63 


JC15183 J110403. 13+000644.0 


0.0760 


-17.74 


0.89 


0.80 


0.51 


JC16641 J111014. 20-000022.9 


0.0919 


-17.89 


0.81 


0.63 


0.50 


JC22174 J113423. 54-001444.1 


0.0770 


-17.28 


0.61 


0.82 


0.78 


JC23590 J114130. 16-000433. 5 


0.1107 


-17.90 


1.27 


0.82 


0.56 


JC26871 J115821. 52-000520.6 


0.0799 


-17.51 


0.77 


1.06 


0.77 


JC27394 J120013. 37-000747.7 


0.0614 


-16.56 


0.35 


0.86 


0.50 


JC27446 J120016. 76-001119.0 


0.0807 


-17.81 


0.92 


1.05 


0.75 



0.53 
1.18 
0.77 
0.67 
1.36 

0.14 
0.36 
0.43 
0.45 
0.40 
1.04 

1.10 
0.53 
0.64 
1.14 
0.90 
0.59 
0.33 
0.37 
-0.06 
0.49 
0.36 
0.23 
0.11 
0.37 
-0.34 
-0.20 
-0.15 
0.40 
0.11 
0.39 
0.22 
0.34 
0.60 
0.20 



-0.19 
-0.28 

-0.52 

0.12 

0.05 

-0.31 



-0.32 
-0.36 
-0.42 
-0.53 
-0.80 
-0.78 
-0.53 
-0.58 
-0.38 
-0.47 
-0.47 
-0.44 
-0.38 
-0.52 
-0.58 
-0.41 
-0.50 
-0.72 
-0.67 
-0.82 
-0.65 
-0.61 
-1.01 
-0.54 



E+A 

E+A 

E+A 

BLLac 



tl.xAGN 

tl.xAGN 

tl.xAGN 

tl.xAGN 

tl.xAGN 

tl.xAGN? 

tl.xAGN 

tl.xAGN 

tl.xAGN 

tl.xAGN 

tl.xAGN 

tl.xAGN 

tl.xAGN 

tl.xAGN 

t2AGN 

t2AGN 

t2AGN 

t2AGN 

C 

C 

C 

c 
c 
c 
c 
c 
c 

SF 
SF 
SF 
SF 
SF 
SF 
SF 
SF 
SF 
SF 



DRl: gal 
DRl: gal 

Sw, A: BL Lac; RASS: FIRST; DRl: gal; in all-object region 



be Ha; S; DRl: gal 

be H/3; RASS; FIRST 

be Ha; A: broad-line AGN; RASS; DRl: QSO 

be H/3 

be Mg II; S; FIRST; DRl: gal 

bad sky near H/3; be H/3? 

be Ha,H/3; S; A,W: NLSyl; RASS; FIRST; DRl: gal; 2QZ: NELG 

be H/3; 2QZ: QSO 

be Ha; K: LINER; FIRST; DRl: gal 

be Ha, Mg II; S; DV: variable; DRl: QSO 

be Ha; K: LINER; DRl: gal; 2QZ: NELG 

be H/3 

bad sky near [O III]: be H/3; 2QZ: NELG 

be Ha,H/3 



Ka; FIRST: DRl: gal 
FIRST 



2QZ: NELG 

in all-object region 



DRl: gal 



RASS; in all-object region 






Gtq 

Co 



Table 3 - continued 



00 



ID 


^ 


-DMGC 


Rl ( 


« - 9% 


(9 


--)S 


logi2|i] log 


IN III'* 

Ha 


Class-^ 


Comments-' 


MGC27813 J120141. 90-000502.8 


0.0829 


-17.69 


0.51 


1.07 




0.71 


-0.20 


-0.42 


SF 




MGC31360 J121830. 69-001117.2 


0.0849 


-18.07 


0.68 


0.89 




0.58 


0.15 


-0.67 


SF 


DRl:gal 


MGC31370 J121754. 42-001147.8 


0.0708 


-16.87 


0.44 


0.76 




0.83 


-0.30 


-0.41 


SF 




MGC33857 J123007.02-001123.4 


0.0905 


-17.62 


0.64 


1.19 




1.03 


-0.21 


-0.64 


SF 




MGC35989 J124032. 96-001402. 7 


0.1646 


-19.25 


1.35 


0.54 




0.04 


0.62 


-1.04 


SF 


2QZ: NELG 


MGC47144 J133229. 12+001425. 2 


0.0804 


-17.18 


0.40 


0.94 




0.55 


0.23 


-0.70 


SF 




MGC52270 J135230. 99+000345. 7 


0.0833 


-17.55 


0.67 


1.00 




0.69 


0.30 


-0.79 


SF 


2QZ: NELG 


MGC57241 J140846. 88-001243. 5 


0.0950 


-17.73 


0.91 


0.91 




0.49 


0.46 


-1.11 


SF 




MGC60772 J142229. 09-000812.6 


0.0543 


-16.21 


0.45 


0.84 




0.75 


0.35 


-0.82 


SF 




MGC64370 J143243. 74-000105.8 


0.1303 


-18.48 


0.88 


0.77 




0.35 


0.37 


-0.81 


SF 




MGC65997 J143754. 03-001527.0 


0.1400 


-18.51 


1.24 


0.78 




0.52 


0.06 


-0.72 


SF 




MGC67179 J144048. 09-000858.1 


0.1212 


-18.61 


1.01 


0.57 




0.21 


0.55 


-0.89 


SF 




MGC95276 J105526. 00-000501.8 


0.1081 


-19.03 


1.57 


1.14 




1.09 


0.25 


-0.60 


SF 




MGC96202 J101416. 19+001133.4 


0.0713 


-18.40 


0.87 


1.20 




0.92 


-0.21 


-0.42 


SF 




MGC00145 J095931. 46-000013. 3 


0.2152 


-19.64 


1.33 


1.01 




0.59 


0.16 


— 






MGC34325 J123100. 44+000402.0 


0.4633 


-21.14 


2.61 


0.85 




0.20 


0.36 


— 


tl.xAGN? 


2QZ: NELG 


MGC11178 J104353. 73-001419.0 


0.1005 


-18.41 


1.65 


1.24 




0.91 








weak emission lines 


MGC14790 J110156. 38-000001.8 


0.0733 


-17.03 


0.53 


1.50 




1.20 








weak emission lines 


MGC17184 J111122. 09-000636.9 


0.1231 


-18.36 


1.12 


1.26 




0.82 




-0.46 




weak emission lines 


MGC35480 J123642. 47-000946.6 


0.0722 


-16.97 


0.43 


0.72 




-0.50 








Ha,[0 III] saturated; 2QZ: NELG 


MGC52589 J135243. 21-000017.3 


0.1589 


-19.64 


1.12 


0.91 




0.65 


-0.14 






Ha in telluric line 


MGC20168 J112428. 81+000321. 7 


0.3283 


-20.71 


1.26 


2.19 




0.56 


0.50 


— 


star-l-gal 




MGC28037 J120107.34-001639.4 


0.2056 


-20.50 


1.23 


1.25 




0.14 




— 


star-l-gal 


weakH/3 


"Absolute magnitude using hioo = 1, 


Qm = 0.3, Oa = 


0.7. For the BL Lac and 


type 1.x 


objects we assume 


a power-] 


law spectral shape and hence a ii'-correction of 


K{z) = -(a + l)2.51og{l + z),whe 


re a = —0.5. For all other objects we use the ftT-corrections of Driver et al. 1 2005, section 3.2). 

~^ kpc, where the observed apparent half-light radius, r^, has been seeing-corrected using r^ = 




''Physical half-light radius in units of h 


r2-0.32r2iDriveretalJ|2004 



Co 



'^Rest-frame colours using SDSS-DRl PSF magnitudes for the BL Lac and type 1.x objects, and model magnitudes for all others. A'-cotrections as above. 
''a ' — ' indicates that the Ha-[N II] complex is not covered. 

"^SF = star-forming galaxy; C = star-fonning/A GN composite; star-Hgal = superpos ition of a star and a galaxy. ^^^^_^^^^^ ^_^^ ^^^^_^^^^ ^_^ 

•^bc = broad component; A = Anderson et al. ("2003); DV = d e Vries et aj] IKX)'^ : K = Kniazev et al. '2004'); Ka ='Kauffmann et al." ■2003a'); S = ISchneider et alJ <200.'5t) ; Sw = 
ISchwopee tal.. 1.2000,); W = ,Williani s et al. I .200Z) ; RASS = ROSAr All-Sky Survey Ivoges et al. 1999,2000,); FIRST =,White et al,^1997.). 
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We summarise our line ratio measurements and classifications in 
columns 7-9 of Table |3| Finally we note that several of the MCs 
classified as AGN have been previously classified by other authors 
or have been detected in X-ray or radio surveys (see the last col- 
umn of Table|3j- These findings broadly support our AGN identifi- 
cations. 



3.2 Colours 

In Fig. [8| we compare the MCs to the general MGC galaxy and 
QSO populations in the rest-frame {u — g)o-{g — z)o colour-colour 
plane. In this comparison we use SDSS-DRl PSF magnitudes for 
QSOs and for those MCs classified as type 1.x AGN or BL Lac, 
while we use model magnitudes for all other objects. For galaxies 
we als o use the individual A'-corrections derived by IPriver et alj 
^00^. However, these will be inappropriate for those MCs whose 
broad-band fluxes are likely to be strongly contaminated by an 
AGN continuum (the type 1.x and BL Lac objects). For these ob- 
jects and for QSOs in general we use a crude /^-correction assum- 
ing a power-law spectral shape with index a — —0.5 JFrancig 
Il993l) . This correction does not account for the effects of broad 
emission lines which can contribute substantially to the flux in a 
given band. In fact, when considering colours this procedure is of 
course equivalent to applying no A-correction at all. The MCs' 
rest-frame colours are listed in columns 5 and 6 of Table|3| 

The galaxy distribution in Fig. |8| (small grey points) shows 
the well-known division bet ween a red (~ early -type) and a blue 
(« late-type) population (e.e. lBaldrv et alJl20o4 . while the QSOs 
(crosses) exhibit their characteristic UV excess. We note that low-z 
QSOs (red crosses) are somewhat redder in {g — z)o than the bulk 
of QSOs at higher redshifts (cyan crosses) due to Hq falling in the 
z-b and and, pr esumably, contamination from the host galaxies (see 
also lRichardTe t al. 2001). 

The redshift range of the low-z QSOs was chosen to coincide 
with that of the type 1.x AGN (i.e. 0.2 < z < 0.46) and indeed 
we find that their colours are matched by the BL Lac (blue square) 
and by 12 out of 14 of our type 1.x AGN (red triangles). This indi- 
cates that the broad-band UV- visual fluxes of these MCs are indeed 
dominated by an AGN continuum and not by starlight, supporting 
our identification of these objects as AGN based on their emission 
lines. 

One type 1.x (MGC35898) is somewhat bluer in {g — z)q than 
the rest, which is probably due to vigorous star-formation activity 
which is indicated by strong, narrow high order Balmer line emis- 
sion (cf. Fig.le];). In contrast, the 14th type 1.x (MGC03855) is 
significantly redder in both colours. However, it is both an X-ray 
and radio source and so its identification as an AGN is reasonably 
secure. This object may well be reddened by dust in the host galaxy 



b i( 

JRichards et all2003h 



In contrast to type 1.x objects, the AGN contribu- 
tion to the cont i nua of type 2 objects is extremely small 
KaufftnanneLalJ |2003a|), even in th eir n uclear regions 
Schmitt. Storchi-Bergmarm & Fernandea ll999B . and hence 
their broad-band colours are expected to be dominated by their 
host galaxies. Indeed, the colours of our four type 2 objects (red 
dots) are quite different from those of the QSOs but are consistent 
with normal galaxies. 

In a broad-brush sense the properties of the host galaxies of 
type 2 AGN are known to depend on the AGN's [O III] A5007 lu- 
minosity ( Kauffma nn et al. 2003a ). For example, the mean stellar 
ages (as measured by D4000 and the US EW) of the hosts of low- 
luminosity AGN are similar to those of normal early-type galax- 
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Figure 8. Rest-frame colour-colour plot of all 65 MCs and the general 
MGC galaxy and QSO populations as indicated. Abbreviations are those 
of Table I3I 'abs.l.' and 'em.l.' refer to absolution and emission line objects 
without further classification, 'low-z QSOs' refers to QSOs in the same 
redshift range as type 1.x AGN, i.e. 0.2 < z < 0.46. Star/galaxy super- 
positions are shown with and without ^'-corrections (open and solid brown 
squai'es respectively). We use SDSS-DRl PSF magnitudes for QSOs, type 
1.x AGN and the BL Lac, and model magnitudes for all other objects. The 
dashed lines show the (observed-frame) 'stellar locus' defined in Fig.lsl 



ies, while high-luminosity AGN reside in hosts with younger stel- 
lar populations, similar to those of normal late-type galaxies. This 
would explain why we observe both red and blue type 2 objects. To 
check whether our objects are consistent with the above trend we 
need to measure their [O III] luminosities. Unfortunately, only the 
red object's spectrum (MGC321 1 1) is flux calibrated: it turns out to 
be of intermediate luminosity (I/|o„,] = lO^^^^L©) which is how- 
ever low enough for the D4000 and H(5 EW distributions of type 
2s (cf. fig. 12 ofJKauffmann et al.) to still overlap comfortably with 
the ranges typical for normal early-type galaxies (D4000 > 1-7 
and H(5 EW < 1 A). The strength of our object's 4000 A break 
(-D4000 = 1.7) does indeed fall in this overlap region and indicates 
the presence of an old stellar population, but its H5 EW {— 1.9 A) 
is somewhat to o high for normal early-types. In fact, MGC321 1 1 is 
displaced from lKauffmann et alj s J2003hl) well-defined D4ooo-H(5 
relation. This occurs frequently for type 2s and may be interpreted 
as a sign of a significant burst of star-formation in the past two Gyr 
JKauffmann et all2003 J) . 

An old, evolved stellar population together with a signifi- 
cant, truncated t wo-Gyr-old starburst has also been invoked by 
iBalogh et alJi2005.) to explain the (r — K) and (u — g) colours of 
E-l-A galaxies, and so it is perhaps not surprising that MGC321I1 
and our three E-l-As (blue dots) are found in a similar region of 
Fig-in In this scenario a galaxy in the red locus in Fig.|8|is moved 
down and to the left onto the blue sequence by a burst of star- 
formation. After the burst's truncation the galaxy's UV-blue colour 
reddens again quite rapidly while redder colours evolve much more 
slowly, leaving the galaxy below the red locus in Fig. |8| We note 
th at our E-l-As' (u — q ) colours are indeed identical to those found 
bv lBalogheTai]J200^ . 

The only MC that is a normal absorption line galaxy (blue 
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triangle, cf. Fig.|6t) lies among the general red galaxy population 
as expected. 

The MCs identified as star-forming (green dots) or composite 
(orange dots) in the previous Section occupy the blue half of the 
general blue galaxy sequence. The star-forming MCs extend to its 
very blue tip. The three objects furthest along the sequence all show 
higher order Balmer lines (e.g. MGC35989, see Fig.|Slc), indicative 
of strong star-formation activity. 

Notice the odd locations in Fig.|8|of the two likely star/galaxy 
superpositions (open brown squares). These are not too surprising: 
if the broad-band fluxes of these objects are dominated by the stars 
rather than the galaxies then their iT-corrections are meaningless. 
Indeed, the objects' observed-frame colours (solid brown squares) 
place them squarely in the centre of the stellar locus (dashed lines, 
cf. also Fig.|3}. 

The unclassified emission line objects (brown dots) mostly lie 
along the general blue sequence. The reddest three objects are also 
those with the weakest emission lines while the object near the tip 
of the blue sequence (MGC34325) again shows high order Balmer 
lines. However, we have re-plotted Fig. [8| using PSF magnitudes 
and QSO A'-corrections for the unclassified objects to see whether 
any would lie among the low-z QSOs. This was indeed the case 
for MGC34325 and so this object could also be a type 1.x AGN. 
Finally, the object at (g — z)o — —0.5 is the one with saturated 
Ha and [O III] A5007. It also shows very strong high order Balmer 
lines and may hence be undergoing an extreme starbursting phase. 

3.3 Redshifts, luminosities and sizes 

In Fig. 121 we show the distribution of the MCs in the absolute 
magnitude-redshift plane, along with the rest of the MGC galaxy 
population. To calculate the absolute magnitudes we use the same 
Tf-corrections as in the previous Section and we list the MCs' red- 
shifts and magnitudes in columns 2 and 3 of Table|3| 

The MCs appear to be segregated in the Mb-z plane accord- 
ing to their type: star-forming and composite objects mostly lie 
along the faint magnitude limit at low redshifts, while the type 1.x 
AGN all lie at 2: > 0.2 and correspondingly brighter magnitudes. In 
fact, the only other MCs at 2; > 0.2 are the BL Lac, the star/galaxy 
superpositions and two unidentified emission line objects. One of 
these lies a.1 z ^ 0.2 and the other is MGC34325 (at z = 0.46) 
which was already suspected of being a type 1.x in the previous 
Section. Assuming that MGC34325 is indeed a type 1.x, we find 
that all MCs whose continua are not dominated by starlight lie at 
z > 0.2 and Mb < —19.3 mag, while the 'real' galaxies among 
the MCs all lie a.t z < 0.22 and Mb > -20.3 mag, so that there is 
almost no overlap between the two groups. In particular, this pro- 
duces a pronounced bimodality in the MCs' magnitude distribution. 
From now on we will generically refer to these two groups as 'real' 
and 'AGN-dominated' MCs (47 and 18 objects respectively, where 
the latter group also contains the star/galaxy superpositions). 

So why are 'real' and 'AGN-dominated' MCs distributed so 
differently? And why are both groups distributed differently from 
the general MGC galaxy population? 

For the 'AGN-dominated' MCs two obvious explanations 
come to mind, (i) It is well-known that the fre quency of nuclear ac - 
tivity decreases with decreasing redshift (e.g. ICroom et alj|2004b . 
This is also seen in the MGC where the density of QSOs falls 
off rapidly towards lower z. (ii) The frequency of nuclear activity 
might also decrease with luminosity or size. In this case the hosts of 
the low-z counterparts of our type 1 .x would probably be resolved 
and so they would not be selected as MCs. To test this explana- 
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Figure 9. Absolute magnitude vs. redshift for all 65 MCs and the general 
MGC galaxy and QSO populations. Symbols are as in Fig.|8] The solid lines 
show the bright and faint apparent magnitude limits of the MGC (using 
the maximum and minimum i^T-con'ection respectively) while the dashed 
line marks the approximate bright magnitude limit of MCs (= 18 mag, 
cf. Section l2.2.2l . The green and red lines show the limits of the regions 
occupied by the 'real' and 'AGN-dominated' MCs respectively. The side 
panels show the projected normalised distributions for the MCs (solid lines) 
and for the general MGC galaxy population (grey shaded histograms). 



tion we would have to fit the emission lines of all resolved MGC 
galaxies. However, from Fig.|8|it is already clear that only very few 
resolved galaxies with QSO-like colours exist. 

To understand the distribution of the 'real' MCs it is helpful to 
consider their magnitude and size selection limits simultaneously. 
In Fig. llOl we show the 'real' MCs along with the rest of the MGC in 
the physical size-absolute magnitude plane. To be able to compare 
the sizes with each other, especially in the case of small galaxies, 
we have applied the seeing correction of Driver et al. ( 2005). The 
MCs' sizes are listed in column 4 of Table|3| 

First of all we notice the luminosity-size relation of galaxies: 
fainter galaxies tend to be smaller than brighter ones. This already 
explains why in Fig. |9|the 'real' MCs are mostly found near the 
faint magnitude limit: at a given redshift, the smallest galaxies are 
also the faintest. However, this should be true at all redshifts and so 
this relation by itself does not explain why 'real' MCs are prefer- 
entially found at lower z. 

The polygon in Fig. HOI encloses the observable parameter 
space for galaxies al z = 0.08. It is determined by the MGC's 
apparent magnitude, size and low SB limits. These selection limits 
will be discussed further in Section |4| (see Fig. II II and accompa- 
nying text). The minimum size limit, rmin, is of particular interest 
here because MCs are expected to have apparent sizes r^ < rmin- 
The three curved diagonal lines in Fig. 1101 show how the top part 
of the polygon moves as a function of redshift. Essentially, they 
show the locations of galaxies with r^ = rmin and Bmgc = 13, 18 
and 20 mag respectively (top left to bottom right). We shall refer 
to the last of these as the 'small faint line' (SFL). Evidently, galax- 
ies below the SFL cannot possibly have sizes smaller than rmin, no 
matter what their redshifts are. 

The point of this plot is that the slope of the SFL is different 
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Figure 10. Seeing-corrected physical size vs. absolute magnitude for 'real' 
MCs and the general MGC galaxy population as indicated. The large, light 
and dark grey dots refer only to galaxies in the range 18 < Bmgc < 
20 mag. The polygon shows the various selection limits at 2 = 0.08: the 
MGC's apparent magnitude limits, the minimum and maximum size limits 
as well as the low SB limit. The vertical dashed line shows the bright limit 
of the MCs as in Fig. |9] The diagonal lines show how the top part of the 
polygon moves as a function of z. The exact locations of the polygon and 
the lines depend on the assumed X-correction and seeing. Here we have 
used median values for both. 



from the slope of the luminosity-size relation of the galaxies. The 
SFL actually curves away from the galaxy locus at brighter mag- 
nitudes and hence does not equally cut into the galaxy population 
at all luminosities. In fact, at the highest luminosities there is an 
almost empty region between the galaxy locus and the SFL and we 
must conclude that galaxies are very rare in this part of parameter 
space. To make this point clearer we highlight in Fig. llOl aalaxies at 
z ~ 0.08 and 2; ~ 0.18 by large light and dark grey points respec- 
tively. For clarity we only show objects with 18 < Bmgc < 20 mag. 
The low-2 population reaches all the way to rmin and so it is not 
surprising to find galaxies smaller than rmin (and hence 'real' MCs) 
at 2 ~ 0.08. In contrast, the high-z population does not quite reach 
to the SFL and hence cannot reach to rmin, implying that galaxies 
do not exist beyond this limit either (unless the size distribution is 
bimodal). 

Put simply, the distribution of the MCs in Figs.l9land llQl is 
explained by the fact that the MGC's minimum size limit only af- 
fects faint galaxies and that the MGC is already complete at high 
luminosities. 



3.4 Implications 

In the previous Sections we have seen that the spectra, colours, 
magnitudes and redshifts of 18 of our 65 MCs indicate that their 
broad-band fluxes are not dominated by starlight. In the context 
of deriving a luminosity function these objects must be considered 
a contamination. In addition, having found them among the MCs 
raises the possibility that the non-misclassified compact galaxies 
are similarly contaminated. If erroneously included in the analysis 



these 'AGN-dominated' galaxies could significantly bias our LF es- 
timates. 

We have therefore subjected all non-MC galaxies with r^ < 
rmin -f 0.1 arcsec to an analysis of their emission lines and colours 
along the lines of Sections l3. Il and l3.2l There are 192 such galaxies 
but we have found only two type 1.x objects among them. Hence 
the high fraction of 'AGN-dominated' objects among the MCs must 
be due to the colour selection of morphologically stellar targets. In 
any case, from now on we will exclude all 20 'AGN-dominated' 
compact galaxies from all further analysis. 



4 MGC INCOMPLETENESS AT HIGH SB 

In order to quantify the MGC's incompleteness due to the misclas- 
sification of compact galaxies as stars we will now restrict our- 
selves to the complete dataset from the 1.14 deg^ all-object region 
described in Section 12.2.1 1 446 of the 1552 objects in this region 
are confirmed galaxies (cf. Table0. Of these, two objects are MCs, 
i.e. they had originally been misclassified as stars. (The third MC in 
this region is 'AGN-dominated' and will not be considered here.) 

The goal of this Section is now to use the MCs in the all-object 
region to answer the following two questions: (i) Ho w reliable is 
the MGC's high SB selection limit (as determined bv lDriver et alJ 
2005), not just in the all-object region but for the survey as a whole? 
(ii) How many galaxies beyond this limit were missed by the MGC? 
In other words we wish to determine the MGC's incompleteness 
due to misclassification as a function of SB (or size, see below). 
Since the number of misclassified galaxies must depend to some 
extent on seeing, this task is complicated, at least in principle, by 
the fact that the typical seeing of the all-object region is different 
from that of the MGC as a whole. Although the difference is only 
10 per cent we have no way of knowing a priori whether it is im- 
portant or not. 

We begin by recalling that the MGC's high SB limit is in fact 
a minimum size limit (see Appendix A of iDriver et a l. 2005). We 
use the half-light radius, re (in arcsec), as our size measure, which 
is defined as the semi-major axis of the ellipse containing half of 
the object's total flux (as measured by SEXTRACTOR'sBEST mag- 
nitude). In Fig. II 11 a) we show the distribution of objects from the 
all-object region in the r^~BuGc plane. The grey shaded areas show 
the various selection limits: the imposed bright and faint magni- 
tude limits of 16 and 20 mag respectively (see Section l2.2.1> . the 
limiting SB and the maximum and minimum size limits. We deter- 
mined the latter three limits by simulating galaxies over the whole 
re-BuGc plane using the image characteristics of the real MGC 
data, and extractin g the objects in the same manner as the real data 
(see Appendix A of lDriver et alj2005h . Here we are only concerned 
with the minimum size limit, rmin- 

Driver et al. 1 2005) defined rmin as the minimum re at which a 
galaxy can still be reliably distinguished from stars. To derive rmin 
they reasoned as follows: the ability to distinguish a galaxy from 
stars depends primarily not on the absolute value of the galaxy's 
re but rather on the distance of the galaxy to the stellar locus in 
re-space, where the scale of that distance is set by the width of 
the stellar re distribution. Obviously, the observed re values of 
stars and small galaxies are a ffected by se eing, and hence rmin 
must be a function of seeing. iDriver et alj observed that for the 
MGC the median r^ value of stars is given by 0.6 F, where F is 
the seeing FWHM, and that the width of the stellar re distribu- 
tion is independent of seeing. These simple considerations imply 
rmin ~ 0.6 F + A, and using the simulations they determined 
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Figure 11. (a) Distribution of objects from the all-object region in the observed half-light radius-magnitude plane. Stars and QSOs are marked by small black 
points, galaxies by green open circles and misclassified compact galaxies by large solid red dots. Grey shaded areas show the observational selection limits 
discussed in the text. The seeing dependence of the minimum size hmit is indicated by different levels of grey shading. We note in passing that the low SB and 
maximum size limits are not the absolute detection Umits but rather the limits to which accurate photometry is possible. The slight increase of the stars' r^ at 
the brightest magnitudes is due to the onset of flooding, (b) Same as (a) but replacing re with p = re — 0.6 F, i.e. the difference between an object's re and 
the median re of stars observed in the same seeing. 



A = 0.31 arcsec. The seeing dependence of r^in is represented 
in Fig. llir a) by using different levels of grey shading for the best, 
median and worst seeing encountered in the all-object region. 

The above also implies that p — re—O.GFisa useful measure 
of an object's similarity to stars. In Fig. lllf b') we plot the distribu- 
tion of galaxies and stars in the p-Bucc plane. The transformation 
re —> p removes the seeing dependence of the stellar locus [cf. 
panel (a)], aligning all stars around p = while preserving the 
width of the stellar re distribution. The minimum size limit is now 
simply given by pmin = 0.31 arcsec and is also seeing indepen- 
dent. Note however that for a galaxy p is not seeing independent. 
While a galaxy moves downwards in Fig. llir a') when observed un- 
der worse seeing conditions, it moves upwards in panel (b). 

From Fig. llir b) we can see that the two MCs lie near pmin 
and other galaxies, which were correctly identified as such, exist 
at similar p values, indicating that the probability of misclassifying 
such galaxies is not 1. Both MCs lie beyond the minimum size 
limit, resulting in 100 per cent completeness within the selection 
limits, while the completeness at p < pmin is ~50 per cent (two 
out of the four galaxies in this range were misclassified). 

We would now like to estimate the probability of misclassi- 
fying a galaxy and the resulting incompleteness for the full MGC 
more quantitatively. First, we define some notation. In the follow- 
ing A'g and N„c denote the numbers of galaxies that were ac- 
tually identified as such and of those that were misclassified as 
stars respectively. Hence the true number of galaxies is given by 
A'o = A'mc + A^g- The superscripts 'AO' and 'MGC refer to objects 
in the all-object region and in the full survey region respectively. 

Now consider the binomial probability, pg, that a galaxy is 
morphologically misclassified as a star from the imaging data by 
the procedure described in Section 1211 It follows from the dis- 
cussion of the minimum size limit above that pg should only be 
a function of p and that Pg{p) is the same for any observation. The 
number of galaxies that are actually misclassified in a given obser- 
vation is then determined by the distribution of galaxies in p, nc{p), 
which in turn depends on the seeing. We would like to estimate the 



number of misclassified galaxies in the MGC with p larger than 
some piim: 

/■OO /"OC 

Kr(piim) = / rC:^{p)dp= pg(pX'°^(p)dp, (I) 

•'Plim •'Plim 

where we need to measure pg (p) from the all-object region. Ideally, 
we would estimate it as 



M 



ni°{p) 



(2) 



in small bins of p. However, from Fig. II li b) it is clear that the 
interesting region of p near pmin is not populated densely enough 
to do this. We adopt two approaches to deal with this. 

4.1 Parameterising pg (p) 

In the following we will assume a functional form for Pg(p) and 
use the data to constrain the model's parameter(s). Which func- 
tional form to use? Clearly, Pg(p) should rise monotonically from 
at p S> pmin to 1 at p > 0. Furthermore, given the paucity of 
data, the model should have at most two parameters. In general, 
these two parameters will characterise in some sense the 'position' 
and steepness of pg(p). Constructing a one-parameter model will 
require fixing one of these at some a priori value. The only choice 
that is not completely arbitrary seems to be Pg(0) = 1. 

Using these general considerations as a guideline we now 
choose the following four models: 
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We have chosen three different one-parameter models in order to 
be able to test the sensitivity of the results to the precise functional 
form. In each case the free parameter (c, a, uj) controls the steep- 
ness with which pg declines to 0. In contrast we have chosen only 
one two-parameter model (the error function) because, as we shall 
see below, such models are already over-fitting the sparse data. 

We now fit these models to the data from the all-object region 
using maximum likelihood. The likelihood function is given by 



^ « jjpgCpo n [1 ~pg(^i)]' 



(7) 



j=i 



where the first product runs over all MCs and the second prod- 
uct over all other galaxies in the all-object region. Maximising this 
function with respect to the model parameters yields the results 
shown in panels (a)-(b) of Fie. 1121 where the grey-shaded regions 
indicate 90 per cent confidence ranges. 

Due to its additional free parameter the erf model behaves 
quite differently from the other models. It rises quite steeply just 
below pmin to accommodate the observed misclassified fraction of 
~50 per cent in this region (cf. Fig. II lb ) and predicts that essen- 
tially all galaxies with p < 0.18 arcsec should be misclassified. 
The other three models exhibit a more gentle rise, with the cosine 
and gaussian models resulting in very similar pg curves. The small- 
ness of the errors on pg for small and large p values is of course a 
consequence of having assumed a specific functional form. 

We now estimate the number of misclassified galaxies in the 
full MGC as 
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and define the incompleteness of galaxies with p > 
misclassification as 



Pii, 



(8) 



due to 
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(9) 



N^°^{piini) is shown in panel (c) of Fig. 1121 Recall that accord- 
ing to the erf model essentially all galaxies with p < 0.18 arcsec 
should have been misclassified as stars. However, from Fig. ll2r c) 
we can see that a few galaxies with p < 0.18 arcsec were in fact 
identified in the full survey. This causes the erf model to predict an 
infinite number of misclassified galaxies. This unrealistic result is 
of course due to the paucity of data in the all-object region com- 
pared to the number of free parameters of the erf model, which is 
also reflected by the large error on Pg'' shown in in Fig. ll2r b). We 
will not consider this model any further. 

Note also that the few observed galaxies at small p values also 
justify our choice of pg(0) = 1 for the one-parameter models. 
Moving the pg = 1 point to a larger p value would have caused 
the same problem as above. 

The incompleteness resulting from the linear, gaussian and co- 
sine models is shown in panel (d) of Fig. 1121 For a given model we 
can now read off the numbers that were the goal of this Section: 
(i) the incompleteness of galaxies within the selection limit, i.e. 
at Piim = Pmin w herc the MGC was assumed to be complete by 
iDriver et alJ i2005l) and (ii) the MGC's overall incompleteness. We 
summarise these values in columns 2 and 4 of Table|4|respectively. 
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Figure 12. (a) Best fit models of the binomial probability of misclassifying 
a galaxy as a function of p = r^ — 0.6 F, using the linear, gaussian and 
cosine models of equations |3)-|5} as indicated. The fits are obtained from 
the observed abundance of MCs in the all-object region, where we can be 
certain that all galaxies have been identified. The grey shaded area shows 
the 90 per cent confidence range for the gaussian model, (b) Same as (a) for 
the erf model of equation |5|. (c) Number of observed galaxies (i.e. those 
that were originally identified as such) in the full MGC survey region with 
P > Plim- (d) Estimates of the incompleteness of galaxies with p > pui^ 
due to their misclassification as stars based on the binomial probabilities in 
panel (a) and the number of galaxies in panel (c). The thin red line is the 
incompleteness derived from all known ('real') MCs in the full MGC survey 
region. Since this sample is incomplete (cf. Section l2.2.2l . this line is a strict 
lower limit to the true incompleteness. The grey shaded area shows the 90 
per cent confidence range for the gaussian model and the vertical dashed 
line marks the minimum size limit for the reliable detection of galaxies 
adopted bv lPriver et all <2005h . 



We also show in column 6 the incompleteness of galaxies beyond 
the selection limit. 

From Fig. I12r d) we can see that the differences between the 
three models are smaller than the random errors (shown only for 
the gaussian model by the grey shaded area). Hence, given the size 
of the errors, the results do not depend sensitively on the precise 
shape of the assumed functional form of pg. 

Nevertheless, the incompleteness estimate provided by the lin- 
ear model is somewhat lower than that of the other two models and 
may in fact be too low. In Fig. ll2r d) we plot as a thin red line the in- 
completeness derived from all known 'real' MCs in the MGC (47 in 
total), i.e. those in the all-object region as well as those discovered 
through our additional spectroscopy of stellar objects described in 
Section 12.2.21 Since this sample is itself incomplete, the incom- 
pleteness derived from it must be considered a strict lower limit to 
the true incompleteness. The estimate from the linear model traces 
this limit almost exactly. The cosine model, which gives the highest 
incompleteness estimates, is most consistent with the lower bound- 
ary. We summarise the lower limit on the incompleteness in the last 
line of Tabled 

Note the tail of MCs at large p values identified in the addi- 
tional data. There are a total of 10 MCs with p > pmin, however 
the tail consists mostly of six objects at p > 0.45 arcsec. Their 
large p values are due to re measurement errors caused by slightly 



© 2006 RAS, MNRAS 000,[T][T2 



14 J. Liske et al. 



Table 4. Estimates of the MGC's incompleteness due to misclassification of galaxies as stars (16 5C Bmgc < 20 mag). 



Pmin < P 

hi<r Ri{<r 

(per cent) 



< p 

/g«)" Ri{<r 

(per cent) 



From Section l4m using different models: 
Linear 0.05(0.51) (0.41) 

Gaussian 0.15(0.66) 0.05(0.56) 

Cosine 0.07(0.51) (0.41) 

From Section lT2l based on different objects: 
galaxies 0(0.52) (0.42) 

stellar objects 0(0.93) (0.83) 

visually inspected 0(0.99) (0.90) 

Area 0(0.82) (0.72) 



0.49(1.28) 0.02(0.81) 
0.86(2.01) 0.39(1.54) 
1.05(2.31) 0.58(1.85) 



0.45(1.40) (0.95) 

0.65(1.98) 0.19(1.52) 

1.07(2.94) 0.61(2.49) 

0.54(1.69) 0.07(1.23) 



From SectionIO 0.14 
Lower limif^ 0.10 



0.04 



0.61 
0.47 



0.15 



"^Upper limit at 95 per cent confidence. 

''Remaining incompleteness after accounting for 'real' MCs already known. 

'^Derived from all known 'real' MCs in the MGC. 



< p < Pinin 

(per cent) 



36.0(50.5) 5.2(26.7) 
48.1(64.2) 23.2(47.0) 
56.1(70.5) 35.0(56.4) 



35.8(52.8) 6.6(31.3) 

44.9(57.4) 19.8(38.0) 

57.1(71.5) 37.6(58.6) 

41.4(53.8) 13.2(31.6) 



38.6 
32.5 



9.0 



erroneous SEXTRACTOR object ellipses which in turn are due to 
nearby bright objects in four of the six cases. The one-parameter 
models are not equipped to handle a non-zero tail of objects at large 
p. The gaussian is best suited, but even for this model the incom- 
pleteness derived from these objects eventually exceeds even the 
upper end of the 90 per cent confidence range. This highlights that 
the derived errors on the incompleteness estimates do not include 
the shortcomings of the chosen models. 

In columns 3, 5 and 7 of Table |4] we list the MGC's re- 
maining incompleteness, 7?g(piim), after accounting for those MCs 
that have already been discovered. Specifically, i?g is calculated 
by applying equation J9) after having subtracted the known 'real' 
MCs from the number of MCs predicted from the all-object re- 
gion (N"^"^) in the numerator. If the number of known MCs exceeds 
the predicted number (see discussion above) we only list an upper 
limit. 



4.2 Assuming T^ = r„ac 

Recall that the formalism of the previous Section was necessary be- 
cause of the different seeing in the all-object region and the MGC 
as a whole. In this Section we will simply ignore this difference. 
Since in fact Fag < Fmcc, this assumption introduces a bias to- 
wards lower incompleteness. 

Ignoring the seeing difference means that we can assume that 
the p distribution of galaxies in the all-object region is the same as 
in the full MGC (apart from the normalisation): 



n°{p) 
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(10) 



where A'^ = J^ n(/a) dp. Substituting into equations ^2} and Q 
we find for the number of misclassified galaxies in the full MGC: 



^^'"^(Plim) = A''Mc(Plim) 



k: 



m' 



(11) 



i.e. we simply scale the number of MCs in the all-object region by 
the ratio of the total numbers of galaxies in the full MGC and the 
all-object region. For simplicity and so that we do not have to make 
any reference to objects with p < pn^ we prefer to replace this 
ratio with N^°'^ {pih-n)/N^° (piim) which is valid under the assump- 



tion of having the same seeing in the all-object region and in the full 
MGC. In analogy to equation J2j we now define the 'cumulative' 
binomial probability as 



(12) 



- / N A'^Mc(Plim) 

Pg(Plim = ^TT— ^ T 

A*.; (Plim) 

and we rewrite equation <IH as 

^T^CPlim) = Pg(piim)A''o'°^(Plim) 

= Pg(Plim) [A^Mr(plim) + AT''(Plim)] 
^ Pg(piim) 

1-Pg(pim, 

analogous to equation Js}. 

Given the assumption Fao ~ Fmgc we can consider several 
other ways of estimating ACc '^ : 

First, instead of scaling the number of MCs by the total num- 
ber of galaxies as in equation <1I> . we can simply scale by the ob- 
served areas on the sky, A, thus making no reference to any objects 
other than the MCs: 



-iVg°"(piim), 



(13) 



AT^lPlim) = A'',;;c(piim) 



A^ 



(14) 



Next, from an a posteriori point of view it is interesting to ask: 
given an object morphologically classified as stellar, what is the 
binomial probability, ps, that it is actually a galaxy? We now have 



Ps(plin 

and 



N:°{pun.) 
iVs"(piim) 



K°''{plim) =Ps(piim) iVs"°^(piim), 



(15) 



(16) 



where A'^s is the number of objects morphologically classified as 
stellar. This includes stars, QSOs and MCs. Note that the pro- 
cess of testing whether a morphologically stellar object is in fact 
a galaxy is not really a binomial process because the outcome is 
pre-determined. In fact, compared to the previous two estimates of 
this Section we expect equation <16> to result in a higher incom- 
pleteness because the average stellar density in the full survey is 
higher than in the all- object region (due to different mean Galactic 
latitudes, cf. fig. 11 of|ListeetjL|20o3). 
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Figure 13. (a) Cumulative binomial probability of misclassifying a galaxy 
with p > piim as a star, (b) Cumulative binomial probability that an object 
classified as stellar and with p > p^y^ is actually a galaxy, (c) Cumulative 
binomial probability that an object classified as stellar by visual inspection 
and with p > pi^^^ is actually a galaxy, (d) Number of objects in the full 
MGC survey region with p > Piim- 'g' and 's' refer to objects originally 
classified as galaxies and stars, while 'v' denotes that subset of 's' which 
was classified by eye. (e) Estimates of the incompleteness of galaxies with 
P > Plim due to their misclassification as stars based on the probabilities 
in panels (a)-(c) (which were derived from the all-object region) and the 
numbers of objects in panel (d) as indicated. The thin red line is the in- 
completeness derived from all known ('real') MCs in the full MGC survey 
region, and gives a lower limit on the true incompleteness. The grey shaded 
areas show 90 per cent confidence ranges and the vertical dashed line marks 
the minimum size limit adopted bv .Driver et aL l2QQ5-) . 



Finally, we note that both of the MCs in the all-object region 
have stellaricity < 0.98 (cf. Fig.|4} and therefore their original clas- 
sification as stars was the result of visual inspection (see Section 
12. H . Hence we can gain yet another estimate of the incompleteness 
by substituting A^s above with the respective number of stellar ob- 
jects classified as such by visual inspection, N^. However, we must 
expect this estimate to return an even higher incompleteness than 
the previous one because it suffers an additional bias: the fraction 
of stars classified visually is higher in the full survey than in the 
all-object region because of the worse seeing. 

In Fig. ll3l we show the above cumulative probabilities derived 
from the all-object region (panels a-c), the appropriate numbers of 
objects in the full survey region (d) and the resulting incomplete- 
ness estimates (e). The latter are summarised in Table|4] 

Given the errors, the derived 7g values of this Section are rea- 
sonably consistent with each other. This implies that the effects of 
the differences in the stellar density and visually classified fraction 
between the all-object region and the full MGC are not dramatic. 
Nevertheless, they are detectable, and we have Jg < /| < 7g as 
expected. We also find that Jg is smaller than the incompleteness 
estimates from the previous Section and we attribute this to the 
seeing difference between the all-object region and the full MGC. 
However, compared to the size of the random errors the systematic 
shift is again small. 

Note that both MCs in the all-object region have p < pmin- 



Therefore the all-object region predicts /g(piim = Pmin) = 
which is of course below the lower limit set by the sample of al- 
ready known MCs in the full MGC (again shown as a thin red line 
in Fig. 113b ). Hence, in Table |4| we only list upper limits for the 
residual incompleteness Rg at pmin- 

A major difference between Figs. ll3r e) and ll2r d) is the size of 
the errors for large pum- The error of, say, 7g(0.6) in Fig. llST e) is 
simply set by the observation that none out of A'^°'^(0.6) galaxies 
in the all-object region had been misclassified, while the smallness 
of the error on Jl''"^ (0.6) in Fig. I12r d') is due to the additional 
assumption of a specific function forpg. 

Even though /g(piim) should be a monotonic function, 7| in 
Fig. lOr e) exhibits a small decline from pum ~ 0.12 to arcsec. 
This reflects a small misalignment between the stellar p distribu- 
tions, ns(p), in the all-object region and the full survey. In any 
case, since the fluctuation is much smaller than the errors, it has 
little effect on the result. 



4.3 Correcting the MC sample for incompleteness 

In the bottom panels of Figs. 1121 and \^ we have compared the 
various 7g estimates predicted from the complete sample of MCs 
in the all-object region with the lower limit on 7g derived from 
the incomplete sample of all known 'real' MCs in the full MGC. 
Almost all of the estimates are within a factor of 2 of the lower 
limit, suggesting that our additional observations of stellar objects 
have probed at least part of that region of observational parameter 
space in which MCs exist. 

As a cross-check we will now apply a rudimentary correction 
to the number of MCs discovered outside of the all-object region 
(see Section l2.2.2> to see whether we recover a similar overall in- 
completeness as in the previous Sections. We perform the correc- 
tion by dividing the four-dimensional parameter space spanned by 
TJmgc, {u — g), {g — z) and SDSS-DRl morphological classifica- 
tion into five separate regions. We then determine the spectroscopic 
incompleteness of stellar objects in each of these regions and ap- 
ply this factor to the number of MCs discovered in them, implic- 
itly assuming that each region was sampled homogeneously. Since 
MGCz contributed most of the spectra in the 'additional' sample, 
we follow the MGCz target selection described in Section l2.2.2l in 
defining the above regions. The result of applying this correction 
is that the MGC's overall incompleteness is 0.61 per cent, in rea- 
sonable agreement with the models of Section l4m (cf. column 4 of 
Table|4j. The 95 per cent confidence upper limit on this number is 
1.74 per cent but there is an unknown additional uncertainty due to 
our use of an approximate selection function. Hence the estimate 
above is much less reliable than those of the previous Sections and 
we do not quote any upper limits in Table|4] 



5 LUMINOSITY FUNCTION AND DENSITY 
INCLUDING COMPACT GALAXIES 

In the previous Section we have seen that the standard MGC galaxy 
catalogue is missing up to 64 per cent of galaxies with r^ < rmin- 
However, since this translates to only 2 per cent of all galaxies (cf. 
columns 6 and 4 of Table|4| gaussian model) one might be tempted 
to conclude that MCs and other compact galaxies are entirely neg- 
ligible with respect to the galaxy population as a whole. However, 
since the raw number of non-compact galaxies is dominated by lu- 
minous objects and since compact galaxies are preferentially faint 
(cf . right panel of Fig.|9}, the above impression may be misleading. 
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Clearly, we need to consider the issue as a function of luminosity 
before we can reach a final conclusion regarding the relevance of 
compact galaxies and their misclassification. The goal of this Sec- 
tion is therefore to evaluate the contribution of compact galaxies to 
the local galaxy LF. 

In the following we construct LFs using a variant of the bivari- 
ate ste p-wise maximum likelihood (SWML) method of lDriver et alJ 
yOOa). We impose the same apparent magnitude, low-SB and red- 
shift limits: 13 < Bmoc < 20 mag, re < 15 arcsec, /icff < 
25.25 mag arcsec^^ and 0.013 < z < 0.18, but we do not impose 
a minimum size limit. In our context of compact galaxies we can 
hence regard this metho d as being equivalent to a standard mono- 
variate SWML analysis J Efstathiou. Ellis & Petersonll988l) . where 
all SB selection effects are ignored. The only difference is that we 
are in fact accounting for the MGC's low-SB limits. 

The application of the above limits leaves us with 7803 non- 
misclassifi ed galaxies. T o convert apparent to absolute magnitudes 
we use the lDriver et alj global evolutionary correction and individ- 
ual iiT-corrections, where the former is given by E{z) = —0.75 x 
2.51og(l + z). 

We begin now by first constructing the LF of non-compact 
galaxies only, i.e. we exclude all MCs as well as all non- 
misclassified galaxies with re < rmin from the sample. This is 
the sample that would result from a severe high-SB selection ef- 
fect (in the sense that every galaxy below rmin is missed) and 
its LF will obviously be an under-estimate of the true LF. In the 
following we will refer to this as the non-compact LF (NCLF). 
In Fig. 1 141 we show the NCLF as black solid squares, where we 
have n ormalised all data points by an arbitrarily chosen lSchechten 
il97d) function with parameters: Mg — 5 log ft = —19.63 mag, 
(j)* = 0.0174 /i^ Mpc"^ and a = -1.16. We show only poisson 
errors since we will only be comparing various estimates of the LF 
derived from essentially the same volume. 

Next we add in the non-misclassified compact galaxies with 
re < rmin (CGs), wMch somewhat raises the LF at the faint end 
as expected (blue open squares). However, we know that these CGs 
are ~50 per cent incomplete, and so we are still under-estimating 
the true LF. This is the LF one obtains from the MGC data if the 
issue of high-SB selection effects is simply ignored. 

The brown open circles in Fig. ^| show the result of also 
adding in the known 'real' MCs. Since not all MCs were actually 
discovered, this sample is still incomplete and we are still under- 
estimating the true LF. 

Finally, we derive a LF estimate using an incompleteness cor- 
rection: in Section l4m we postulated that the classification of galax- 
ies is a random process, where the binomial probability of misclas- 
sifying a galaxy as a star, pg, depends only on the galaxy's p = 
re — 0.6 r. Hence we can recover the total number of galaxies by 
simply weighting each non-misclassified galaxy by [1 — pg(p)]~^. 
The result is shown as red solid dots in Fig. ll4l where we have used 
the gaussian estimate of pg (cf. Section 1411 . Using the cosine or 
linear estimates instead results in the LFs marked by the upper and 
lower ends of the light red vertical boxes respectively. The error- 
bars protruding from these boxes show the uncertainties due to the 
90 percentile errors on the cosine and linear model parameters. 

We list the parameters of the best fitting Schechter functions 
of the above LFs in Table|5] 

For comparison we also plot in Fig. ll4l as green open triangles 
the MGC LF as derived by using the fully bivariate SWML method 
of lDriver et al. (2005), including the minimum size limit. In this 
case we are using the same sample as for the NCLF, i.e. the MCs 
and CGs are excluded, but now the rmin selection limit is properly 
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Figure 14. Various estimates of the MGC's luminosity function as indicated 
and explained in the text. All estimates are normalised by an arbitrary refer- 
ence Schechter function (RSF), which has Alg — 5 log h = —19.63 mag, 
4>* = 0.0174 h^ Mpc^"^ and a = —1.16. The grey shaded area shows a 
±1(T range around the RSF that is typical of the Schechter function fits of 
the various LFs. The LFs are derived using an essentially standard SWML 
method, e xcept the one labelle d 'BSWML' where we use the full bivaiiate 
method of lDriver et alJ i20O.'5ft . 'CGs' refers to non-misclassified compact 
galaxies. The errors on the NCLF are poissonian only. The errors on the 
incompleteness corrected LF reflect the uncertainties due to the choice of 
the incompleteness model (boxes) and the models' parameters (en'orbars). 
The black dotted and red dashed lines show the best fit Schechter function 
for the NCLF and the incompleteness corrected LF respectively. Some data 
points are offset horizontally for clarity. 



taken into account in the SWML analysis by appropriately reduc- 
ing each galaxy's observable parameter space in the luminosity- 
SB plane. This is equivalent to reducing the volume over which 
a galaxy could have been observed, and hence the resulting LF is 
somewhat higher than in the standard NCLF case. 

Examining Fig. ll4l we first of all notice that measurable differ- 
ences between the various LF estimates are confined to the regime 
AIb -SJ — 18 mag. As mentioned above, this is due to the com- 
bined effect of rmin and the luminosity-size relation of galaxies as 
discussed in connection with Fig. llQI 

Secondly, comparing the red dots to the blue open squares, 
it seems that the incompleteness of compact galaxies due to mis- 
classification does not affect the LF significantly: for almost all 
points the differences are smaller than the la poisson error. In 
this sense even the effect of the total compact galaxy population is 
only marginally significant (comparing the red dots with the black 
squares). However, the impact of the compact galaxies is obviously 
systematic and hence more significant when considering the faint 
end of the LF as a whole: the overall effect of the compact galax- 
ies is to steepen the faint-end slope a by 0.05, which is a 2.5a 
effect (cf. Table|5j, while the misclassified galaxies alone steepen 
a by 0.03, which is still marginally significant. The steepening in- 
creases to 0.05 if we use the upper end of the 90 percentile confi- 
dence range on p|''"''(p), and to 0.06 if we use the most extreme 
incompleteness correction (cf. last line of Table|5j. 

To further illustrate this point, we directly compare the LFs 
of compact and non-compact galaxies in absolute terms in Fig. ll5l 
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Figure 15. LFs of compact and non-compact galaxies as indicated. The 
black line and grey region show the Schechter function that best fits the 
NCLF, and ztlcr range. All en'orbars have the same meaning as in Fig. 1141 



Although there are ~2 orders of magnitude fewer compact galaxies 
than non-compact ones at bright magnitudes, they make up ~10 per 
cent of all galaxies in the range — 17 < Mb < — 14 mag, and the 
misclassified galaxies alone contribute ~6 per cent in this range. 

Finally, comparing the incompleteness corrected LF to the 
fully bivariate SWML estimate in Fig. ll4i rgreen triangles) we find 
that the two agree very well for almost all points. Put simply, the 
first method corrects for missing objects while the second method 
corrects for missing parameter space (or volume), and it is reas- 
suring that the two methods yield the same result with no obvious 
systematic bias. 

Since the luminosity density, j, is doininated by the bright end 
of the LF it is only marginally affected by the compact galaxies. For 
a given LF we calculate the luminosity density from the parameters 
of the best fit Schechter function as j = </>* L*g r(a + 2). From 
the last column of Table |5| we can see that the inclusion of com- 
pact galaxies increases j by 3.5 per cent, while the misclassified 
galaxies alone are responsible for an increase of only 2 per cent. 



6 CONCLUSIONS 

We have explored the very high surface brightness (SB) regime of 
the Millennium Galaxy Catalogue (MGC) to assess the impact of 
compact galaxies on the field galaxy luminosity function (LF) in the 
local (z « 0.1) Universe. We have observationally defined 'com- 
pact galaxies' as objects whose half light-radii are so small that 
they cannot reliably be distinguished from stars in the MGC using 
SEXTRACTOR and/or visual examination (i.e. re < 1 arcsec). 

We have studied the incompleteness of the standard MGC 
galaxy catalogue (Bmgc < 20 mag) due to the misclassification 
of compact galaxies as stars using the 1.14 deg^ all-object sub- 
region of the MGC, where we have spectroscopically identified all 
galaxies in the range 16 < Bmgc < 20 mag. We find: 

1. Within th e MGC's minimum size limit (as derived by 

iDriver et alj|2005l) only 0.15 per cent of galaxies are missed due 
to their compactness, with a 95 per cent upper limit of 0.66 per 



cent. Hence we verify IPriver et al.i s assumption that the MGC is 
complete within its minimum size limit. 

2. Beyond the minimum size limit ~50 per cent of galaxies 
are misclassified as stars. 

3. However, with respect to the total galaxy population this 
translates to an incompleteness of only 1.05 per cent of all galaxies, 
with a 95 per cent upper limit of 2.31 per cent. 

Our incompleteness is a fa ctor of 2 .6 lower than the (2.8 ± 
1.6) per cent found by Drinkw ater et alj (l999) for galaxies with 
16.5 ^ &j ^ 19.7 mag in the Fornax Cluster Spectroscopic Sur- 
vey (FCSS). This is presumably due to the higher resolution of the 
MGC compared to the UK Schmidt photogra phic data from which 
the FCSS was selected. The null-result of Morton et alj J1985h 
on the other hand can be explained by their small survey area of 
0.31 deg^ 

The incompleteness values above ignore the fact that a total 
of 47 misclassified compact galaxies (MCs) have already been dis- 
covered through additional spectroscopy of morphologically stellar 
objects. Accounting for these objects reduces the overall incom- 
pleteness to just 0.58 per cent. 

Using this extended, albeit incomplete sample of 47 objects 
we have studied the nature of the MCs. Over 90 per cent of them are 
emission line galaxies (similar to the fraction of Dri nkwater et alJ 
Il999) . but we also find three E-l-A galaxies. Although some E-l-As 
show clear signs of interaction iZabludoff et al. 199a: lYang et alJ 
|2004) they are generally found to be bulge-dominated, spheroidal 
systems JQuintero et al. 2004; Yang et al. 2004; Blake et al. 200^) 
with bright compact cores (Goto 2005) and higher central SB than 
normal bulge-dominated galaxies lOuintero et al. 2004: Yang et aU 
12004^ . Hence their appearance among the MCs is not too surpris- 
ing. 

An analysis of emission line ratios revealed that four of the 
emission line galaxies harbour a type 2 AGN. Even though early 
studies of Seyfert galaxies seemed to sugg est that they were mostly 
hosted by spiral galaxies I Ada mslll977h . recent results from the 
SDSS have shown that the surface mass density distribution of type 
2 AGN hosts is very sim ilar to that of normal early-type galaxies 
teauffmann et all2003ah . Also, their concentration index distribu- 
tion extends to very high values and so we conclude that our com- 
pact type 2s are not unusual. 

Blue compact galaxies, although ill-defined, are the most 
common and w idely studied class of compact galaxies (see 
iKunth & Ostlinl l2000 for a review) and we find that 83 per cent 
of our MCs are blue, star-forming galaxies with strong nebular 
emission lines. We note that these galaxies generally have Mb > 
— 19 mag and are hence fa inter than the luminous blue compact 
galaxies (e.g. IPhillips et aL 1997: Werk et al. 2004^ but overlai 
with the blue compac t dwarfs (BCDs; e.g. lThuan & Marti j|l98ll 
iGil de Paz et al.'2003!). 

T he 'blue spheroids' of lEllis et alJi2005l) (see also lCross et alj 
I2OO4I IPriver et alJl200a) are another type of blue, star-forming, 
sub-L* galaxy. However, in contrast to the BCDs, which often 
have disturbed or irregular morphologies iCairos et alJl2001l) . the 
blue spheroids are very smooth, with concentration and asymme- 
try indeces similar to normal ellipticals. Since our MCs are only 
barely resolved we have not been able to reliably fit their SB pro- 
files. Hence, without higher resolution imaging it is not possible to 
decide whether the MCs are more similar to BCDs or a high SB 
version of the blue spheroids. 

The faintness of the MCs, which is essentially due to the 
luminosity-size relation of galaxies, is the cause of their differen- 
tial effect on the local galaxy luminosity function: our main re- 
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Table 5. Schechter function parameters for the LFs of Fig. fT?! 



Sample 



(IQ-^h^ Mpc-3) 



M^ -5logh 
(mag) 



(lO^hLQ Mpc-3) 



NCLF 


1.73 ± 0.07 


-19.63 ±0.04 


-1.14 ±0.02 


1.93 ±0.13 


NCLF+CGs 


1.72 


-19.64 


-1.16 


1.97 


NCLF+CGs+MCs 


1.70 


-19.64 


-1.17 


1.96 


NCLF+CGs+inc. correction 


1.67 


-19.66 


-1.19 


2.00 


NCLF+CGs+extreme inc. cor." 


1.64 


-19.68 


-1.22 


2.06 



"Using the upper end of the 90 percentile confidence range of the cosine estimate of Pg{p). 



suit is that even in the relatively high-resolution MGC the mis- 
classification of compact galaxies as stars causes the LF to be sys- 
tematically underestimated at the faint end by Aa = O.OSIqqj, 
while the bright end remains unaffected. The faint-end bias is 
comparable in size to the poisson errors of the MGC and mis- 
classified galaxies contribute ~6 per cent to the LF in the range 
— 17 < Mb < — 14 mag. In contrast, the luminosity density is 
only affected at the ~2 per cent level. 

Clearly, we have not uncovered a large population of galaxies 
'hiding' among the stars. On the other hand, high-SB selection ef- 
fects are not completely negligible either and hence should be taken 
into account when constructing LFs. This is 'naturally' achieved 
by first constructing the bivariate space density of galaxies in ei- 
ther the luminosity-SB or luminosity-size plane using a bivariate 
SWML method, and then integrating over the second parameter to 
obtain the LF I Driver et al. 2005). Here we have confirmed that this 
method recovers a result very close to an incompleteness corrected 
LF 

Our result regarding the high SB end of the galaxy distribu- 
tion should be viewed together with the situation at the low SB 
end. While l uminous gala xies are not affected by low SB selection 
effects (e.g. iDriver et al.l l2005). faint galaxies are. Blanton e t alj 
i2005h recently presented an SDSS LF corrected for low SB in- 
completeness at /ieff.r ^ 23 mag arcsec"'^ and concluded that the 
correction resulted in a significant steepening of the faint end of the 
LF. Indeed, at Mb ^ — 16 mag the galaxy population extends to 
even the deeper M GC's SB limit of /xum = 25.25 mag arcsec"^ 
JDriver et alj|2005h . Hence, even in the MGC the faint end of the 
LF is significantly affected by both low and high SB selection ef- 
fects, despite the fact that the MGC is both deeper and of higher 
resolution than any previous major ground-based survey. Clearly, 
to obtain a complete view of the dwarf galaxy population requires 
still deeper and higher resolution imaging (soon to be delivered by 
the VLT Survey Telescope) as well as spectroscopy on 8-m class 
telescopes. 
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